#Open Model

6 articles

TechJun 28, 20268 min

DeepSeek-V4-Pro-DSpark is not a new model but a speculative-decoding V4-Pro

DeepSeek-V4-Pro-DSpark isn't a new base model. It's the same 1.6T V4-Pro checkpoint plus a DSpark speculative-decoding head (~893GB). What config.json and the DeepSpec repo reveal, and why there's no speed benchmark yet.

LLM DeepSeek Chinese AI MoE Inference Optimization Open Model Speculative Decoding

TechApr 27, 20267 min

LLaDA2.0-Uni Is an Open-Weight Diffusion LLM That Unifies Image Understanding and Generation

Inclusion AI released LLaDA2.0-Uni. A 16B MoE diffusion LLM that handles image understanding, 1024px image generation, image editing, and interleaved text-image generation in a single model.

AI LLM Image Generation VLM MoE Open Model Multimodal

TechApr 24, 2026updated11 min

DeepSeek V4 Preview specs: V4-Pro 1.6T and V4-Flash 284B open under MIT, 1M context, 27% inference FLOPs of V3.2

DeepSeek V4 Preview ships V4-Pro (1.6T/49B active) and V4-Flash (284B/13B active) as open weights under MIT, both with 1M context. CSA+HCA hybrid attention, mHC, and the Muon optimizer cut per-token FLOPs at 1M tokens to 27% of V3.2. Day-one API and chat.deepseek.com mode switch covered.

LLM DeepSeek Chinese AI MoE Open Model AI Agent

TechApr 24, 2026updated14 min

Tencent Hy3-preview (295B) vs Ant Ling-2.6-flash (104B): two open Chinese MoEs released the same week

Two open-weight Chinese MoEs landed within 24 hours: Ant Ling-2.6-flash (104B/7.4B active, 7x token-efficiency claim) and Tencent Hy3-preview (295B/21B active, frontier-tier open weights). Specs, licenses, and how they line up against DeepSeek-V3 and GLM-4.5.

LLM Chinese AI MoE Open Model AI Agent Local LLM OpenRouter

TechApr 8, 2026updated8 min

GLM-5.1 (Zhipu, 744B / 40B MoE, MIT): 58.4% SOTA on SWE-Bench Pro, 8h / 6,000+ tool calls without degradation

Zhipu AI's GLM-5.1 is a 744B MoE (40B active, 200K context, MIT) targeting long-horizon agent tasks. Hits 58.4% SOTA on SWE-Bench Pro (edging out GPT-5.4 and Claude Opus 4.6) and sustains performance across 8-hour sessions with 6,000+ tool calls without degradation.

AI LLM Chinese AI MoE Open Model AI Agent

TechApr 3, 2026updated21 min

Google's Gemma 4 launches in four sizes (E2B–A4B), publishing Gemini 3–derived reasoning under Apache 2.0

Google DeepMind has released Gemma 4: four models—31B dense, 26B MoE (A4B), E4B, and E2B—with a 256K context, multimodal input, tool calling, and support for 140 languages.

AI LLM Google Open Model MoE Multimodal Local LLM